Multilingual Part-of-Speech Tagging: Two Unsupervised Approaches
نویسندگان
چکیده
منابع مشابه
Multilingual Part-of-Speech Tagging: Two Unsupervised Approaches
We demonstrate the effectiveness of multilingual learning for unsupervised part-of-speech tagging. The central assumption of our work is that by combining cues from multiple languages, the structure of each becomes more apparent. We consider two ways of applying this intuition to the problem of unsupervised part-of-speech tagging: a model that directly merges tag structures for a pair of langua...
متن کاملUnsupervised Approaches to Part-of-Speech Tagging
There are numerous strategies for designing POS taggers for a specific language; rule-based, probabilistic, hybrid. We focus on unsupervised approaches, i.e. learning tagging probabilities from unlabeled text (Figure 2). This has the potential to speedily scale to any language as it does not require copious amounts of labeled text (supervised training data) or an exhaustive list of handcoded ru...
متن کاملUnsupervised Part-of-speech Tagging
Diierent approaches have been taken in order to solve the part-of-speech tagging problem. Several methods for unsupervised tagging have obtained good accuracies in practice. The approach taken by Brill Bri95] obtains results comparable to the best existing taggers. In this paper we explore the details of this unsupervised part-of-speech tagger and we present a comparison to the Xerox tagger, wh...
متن کاملUnsupervised Part of Speech Tagging for Persian
In this paper we present a rather novel unsupervised method for part of speech (below POS) disambiguation which has been applied to Persian. This method known as Iterative Improved Feedback (IIF) Model, which is a heuristic one, uses only a raw corpus of Persian as well as all possible tags for every word in that corpus as input. During the process of tagging, the algorithm passes through sever...
متن کاملUnsuParse: unsupervised Parsing with unsupervised Part of Speech Tagging
Based on simple methods such as observing word and part of speech tag co-occurrence and clustering, we generate syntactic parses of sentences in an entirely unsupervised and self-inducing manner. The parser learns the structure of the language in question based on measuring ‘breaking points’ within sentences. The learning process is divided into two phases, learning and application of learned k...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Artificial Intelligence Research
سال: 2009
ISSN: 1076-9757
DOI: 10.1613/jair.2843